Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 586672 |
| Missing cells | 71 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 89.5 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 12 |
id has a high cardinality: 586672 distinct values | High cardinality |
name has a high cardinality: 446474 distinct values | High cardinality |
artists has a high cardinality: 114030 distinct values | High cardinality |
id_artists has a high cardinality: 115062 distinct values | High cardinality |
release_date has a high cardinality: 19700 distinct values | High cardinality |
danceability is highly overall correlated with valence | High correlation |
energy is highly overall correlated with loudness and 1 other fields | High correlation |
loudness is highly overall correlated with energy and 1 other fields | High correlation |
acousticness is highly overall correlated with energy and 1 other fields | High correlation |
valence is highly overall correlated with danceability | High correlation |
explicit is highly imbalanced (73.9%) | Imbalance |
time_signature is highly imbalanced (68.6%) | Imbalance |
id is uniformly distributed | Uniform |
id has unique values | Unique |
popularity has 44690 (7.6%) zeros | Zeros |
key has 74950 (12.8%) zeros | Zeros |
instrumentalness has 205083 (35.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-01-14 13:28:39.868136 |
|---|---|
| Analysis finished | 2023-01-14 13:29:36.777666 |
| Duration | 56.91 seconds |
| Software version | pandas-profiling vv3.6.1 |
| Download configuration | config.json |
id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 586672 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| 35iwgR4jXetI318WEWsa1Q | 1 |
|---|---|
| 6cHlho8Qe04uAIa1hd6efJ | 1 |
| 1AL2EDY1U2dLL0WqQGtNu0 | 1 |
| 4vsj6KApKrZnQnF76Zve2u | 1 |
| 5D0srsR8tggP6mLAdBn8d9 | 1 |
| Other values (586667) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 12906784 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 586672 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 35iwgR4jXetI318WEWsa1Q |
|---|---|
| 2nd row | 021ht4sdgPcrDgSk7JTbKY |
| 3rd row | 07A5yehtSnoedViJAZkNnc |
| 4th row | 08FmqUhxtyLTn6pAh6bk45 |
| 5th row | 08y9GfoqCWfOGsKdwojr5e |
Common Values
| Value | Count | Frequency (%) |
| 35iwgR4jXetI318WEWsa1Q | 1 | < 0.1% |
| 6cHlho8Qe04uAIa1hd6efJ | 1 | < 0.1% |
| 1AL2EDY1U2dLL0WqQGtNu0 | 1 | < 0.1% |
| 4vsj6KApKrZnQnF76Zve2u | 1 | < 0.1% |
| 5D0srsR8tggP6mLAdBn8d9 | 1 | < 0.1% |
| 6GbE5GD4xCcnJvpjsasjiB | 1 | < 0.1% |
| 3RFI8uU3RQt4QJoluTYPdm | 1 | < 0.1% |
| 1Bw4w65vm06L97nwvj0JdO | 1 | < 0.1% |
| 5CjJtWtT5CIE4QhgHAdhSm | 1 | < 0.1% |
| 7LXQvp92lpBY2w878B1b0v | 1 | < 0.1% |
| Other values (586662) | 586662 |
Length
| Value | Count | Frequency (%) |
| 35iwgr4jxeti318wewsa1q | 1 | < 0.1% |
| 0brxjhrngq3w4v9frnsfhu | 1 | < 0.1% |
| 0grxu6gkvncvmjbsea0uhe | 1 | < 0.1% |
| 2u7t2vcrlxkp69um0mdes2 | 1 | < 0.1% |
| 0igi1ucz84pyevetnl1lgp | 1 | < 0.1% |
| 07a5yehtsnoedvijazknnc | 1 | < 0.1% |
| 08fmquhxtyltn6pah6bk45 | 1 | < 0.1% |
| 08y9gfoqcwfogskdwojr5e | 1 | < 0.1% |
| 0dd9imxtatgwsmsad69kzt | 1 | < 0.1% |
| 1klkkacg16o5crqpiaf1tz | 1 | < 0.1% |
| Other values (586662) | 586662 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 278639 | 2.2% |
| 1 | 275299 | 2.1% |
| 2 | 274969 | 2.1% |
| 4 | 274047 | 2.1% |
| 3 | 273494 | 2.1% |
| 5 | 272518 | 2.1% |
| 6 | 271098 | 2.1% |
| 7 | 256601 | 2.0% |
| s | 200017 | 1.5% |
| y | 199765 | 1.5% |
| Other values (52) | 10330337 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5174821 | |
| Uppercase Letter | 5157371 | |
| Decimal Number | 2574592 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 200017 | 3.9% |
| y | 199765 | 3.9% |
| e | 199651 | 3.9% |
| i | 199593 | 3.9% |
| t | 199436 | 3.9% |
| w | 199423 | 3.9% |
| r | 199363 | 3.9% |
| v | 199352 | 3.9% |
| k | 199288 | 3.9% |
| h | 199229 | 3.8% |
| Other values (16) | 3179704 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 199714 | 3.9% |
| C | 199535 | 3.9% |
| M | 199508 | 3.9% |
| F | 199310 | 3.9% |
| B | 199193 | 3.9% |
| J | 199101 | 3.9% |
| H | 199084 | 3.9% |
| L | 198897 | 3.9% |
| K | 198842 | 3.9% |
| D | 198680 | 3.9% |
| Other values (16) | 3165507 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 278639 | |
| 1 | 275299 | |
| 2 | 274969 | |
| 4 | 274047 | |
| 3 | 273494 | |
| 5 | 272518 | |
| 6 | 271098 | |
| 7 | 256601 | |
| 9 | 199538 | |
| 8 | 198389 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10332192 | |
| Common | 2574592 | 19.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 200017 | 1.9% |
| y | 199765 | 1.9% |
| A | 199714 | 1.9% |
| e | 199651 | 1.9% |
| i | 199593 | 1.9% |
| C | 199535 | 1.9% |
| M | 199508 | 1.9% |
| t | 199436 | 1.9% |
| w | 199423 | 1.9% |
| r | 199363 | 1.9% |
| Other values (42) | 8336187 |
Common
| Value | Count | Frequency (%) |
| 0 | 278639 | |
| 1 | 275299 | |
| 2 | 274969 | |
| 4 | 274047 | |
| 3 | 273494 | |
| 5 | 272518 | |
| 6 | 271098 | |
| 7 | 256601 | |
| 9 | 199538 | |
| 8 | 198389 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12906784 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 278639 | 2.2% |
| 1 | 275299 | 2.1% |
| 2 | 274969 | 2.1% |
| 4 | 274047 | 2.1% |
| 3 | 273494 | 2.1% |
| 5 | 272518 | 2.1% |
| 6 | 271098 | 2.1% |
| 7 | 256601 | 2.0% |
| s | 200017 | 1.5% |
| y | 199765 | 1.5% |
| Other values (52) | 10330337 |
name
Categorical
| Distinct | 446474 |
|---|---|
| Distinct (%) | 76.1% |
| Missing | 71 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
| Summertime | 101 |
|---|---|
| Intro | 92 |
| Year 3000 | 91 |
| Hold On | 87 |
| 2000 Years | 76 |
| Other values (446469) |
Length
| Max length | 529 |
|---|---|
| Median length | 242 |
| Mean length | 20.243994 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11875147 |
|---|---|
| Distinct characters | 4678 |
| Distinct categories | 22 ? |
| Distinct scripts | 17 ? |
| Distinct blocks | 34 ? |
Unique
| Unique | 376093 ? |
|---|---|
| Unique (%) | 64.1% |
Sample
| 1st row | Carve |
|---|---|
| 2nd row | Capítulo 2.16 - Banquero Anarquista |
| 3rd row | Vivo para Quererte - Remasterizado |
| 4th row | El Prisionero - Remasterizado |
| 5th row | Lady of the Evening |
Common Values
| Value | Count | Frequency (%) |
| Summertime | 101 | < 0.1% |
| Intro | 92 | < 0.1% |
| Year 3000 | 91 | < 0.1% |
| Hold On | 87 | < 0.1% |
| 2000 Years | 76 | < 0.1% |
| Home | 74 | < 0.1% |
| Baby | 72 | < 0.1% |
| Angel | 68 | < 0.1% |
| Stay | 68 | < 0.1% |
| Forever | 65 | < 0.1% |
| Other values (446464) | 585807 | |
| (Missing) | 71 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 118721 | 5.3% | |
| the | 39475 | 1.8% |
| in | 21286 | 1.0% |
| a | 21150 | 1.0% |
| i | 18791 | 0.8% |
| de | 17376 | 0.8% |
| you | 17318 | 0.8% |
| me | 15448 | 0.7% |
| of | 14574 | 0.7% |
| no | 14157 | 0.6% |
| Other values (236150) | 1923628 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1635323 | 13.8% | |
| e | 985846 | 8.3% |
| a | 831482 | 7.0% |
| o | 622983 | 5.2% |
| i | 610106 | 5.1% |
| n | 553982 | 4.7% |
| r | 519226 | 4.4% |
| t | 445938 | 3.8% |
| l | 378543 | 3.2% |
| s | 361331 | 3.0% |
| Other values (4668) | 4930387 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7471048 | |
| Uppercase Letter | 1731546 | 14.6% |
| Space Separator | 1635323 | 13.8% |
| Other Letter | 297495 | 2.5% |
| Decimal Number | 281270 | 2.4% |
| Other Punctuation | 222317 | 1.9% |
| Dash Punctuation | 117593 | 1.0% |
| Close Punctuation | 47899 | 0.4% |
| Open Punctuation | 47843 | 0.4% |
| Nonspacing Mark | 16872 | 0.1% |
| Other values (12) | 5941 | 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| י | 10097 | 3.4% |
| ו | 7873 | 2.6% |
| ה | 6906 | 2.3% |
| ל | 6228 | 2.1% |
| א | 4922 | 1.7% |
| ר | 4652 | 1.6% |
| า | 4594 | 1.5% |
| น | 4527 | 1.5% |
| ב | 4203 | 1.4% |
| อ | 4167 | 1.4% |
| Other values (4110) | 239326 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 985846 | |
| a | 831482 | |
| o | 622983 | 8.3% |
| i | 610106 | 8.2% |
| n | 553982 | 7.4% |
| r | 519226 | 6.9% |
| t | 445938 | 6.0% |
| l | 378543 | 5.1% |
| s | 361331 | 4.8% |
| u | 276252 | 3.7% |
| Other values (190) | 1885359 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 135434 | 7.8% |
| M | 134651 | 7.8% |
| T | 132102 | 7.6% |
| A | 112973 | 6.5% |
| L | 96399 | 5.6% |
| D | 90687 | 5.2% |
| B | 86180 | 5.0% |
| C | 85044 | 4.9% |
| R | 82416 | 4.8% |
| I | 80670 | 4.7% |
| Other values (152) | 694990 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ่ | 3609 | |
| ั | 3300 | |
| ้ | 2910 | |
| ี | 1811 | |
| ิ | 1296 | 7.7% |
| ื | 832 | 4.9% |
| ู | 686 | 4.1% |
| ุ | 620 | 3.7% |
| ็ | 526 | 3.1% |
| ์ | 360 | 2.1% |
| Other values (27) | 922 | 5.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 57370 | |
| , | 47649 | |
| ' | 43395 | |
| : | 24603 | |
| " | 17199 | 7.7% |
| / | 12174 | 5.5% |
| & | 5684 | 2.6% |
| ! | 5147 | 2.3% |
| ? | 4310 | 1.9% |
| ; | 1479 | 0.7% |
| Other values (21) | 3307 | 1.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 53 | |
| ☆ | 33 | |
| ★ | 24 | |
| № | 12 | 7.0% |
| ® | 12 | 7.0% |
| ♡ | 7 | 4.1% |
| � | 3 | 1.8% |
| △ | 3 | 1.8% |
| ○ | 3 | 1.8% |
| ◑ | 3 | 1.8% |
| Other values (13) | 18 | 10.5% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 418 | |
| + | 273 | |
| | | 112 | 11.2% |
| = | 63 | 6.3% |
| > | 52 | 5.2% |
| < | 44 | 4.4% |
| × | 12 | 1.2% |
| → | 10 | 1.0% |
| ∞ | 5 | 0.5% |
| ↑ | 5 | 0.5% |
| Other values (9) | 10 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 63921 | |
| 1 | 53122 | |
| 2 | 50631 | |
| 9 | 20960 | 7.5% |
| 3 | 19856 | 7.1% |
| 4 | 16338 | 5.8% |
| 5 | 15855 | 5.6% |
| 8 | 13580 | 4.8% |
| 6 | 13561 | 4.8% |
| 7 | 13441 | 4.8% |
| Other values (3) | 5 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 45124 | |
| ] | 2375 | 5.0% |
| 」 | 196 | 0.4% |
| 》 | 164 | 0.3% |
| 』 | 17 | < 0.1% |
| 】 | 12 | < 0.1% |
| } | 5 | < 0.1% |
| 〉 | 4 | < 0.1% |
| ⧽ | 1 | < 0.1% |
| ༻ | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 45056 | |
| [ | 2379 | 5.0% |
| 「 | 196 | 0.4% |
| 《 | 164 | 0.3% |
| 『 | 17 | < 0.1% |
| 【 | 12 | < 0.1% |
| „ | 11 | < 0.1% |
| 〈 | 4 | < 0.1% |
| { | 3 | < 0.1% |
| ⧼ | 1 | < 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 2825 | |
| ๆ | 81 | 2.7% |
| 々 | 55 | 1.9% |
| ゝ | 4 | 0.1% |
| ˈ | 2 | 0.1% |
| ʻ | 1 | < 0.1% |
| ˇ | 1 | < 0.1% |
| ˋ | 1 | < 0.1% |
| ـ | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 117055 | |
| 〜 | 375 | 0.3% |
| – | 115 | 0.1% |
| — | 18 | < 0.1% |
| ‐ | 16 | < 0.1% |
| ― | 14 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 168 | |
| ` | 77 | |
| ^ | 2 | 0.8% |
| ¨ | 1 | 0.4% |
| ˙ | 1 | 0.4% |
| ΄ | 1 | 0.4% |
Private Use
| Value | Count | Frequency (%) |
| | 4 | |
| | 2 | |
| | 1 | 10.0% |
| | 1 | 10.0% |
| | 1 | 10.0% |
| | 1 | 10.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 179 | |
| € | 7 | 3.6% |
| £ | 3 | 1.6% |
| ¥ | 3 | 1.6% |
| ¢ | 1 | 0.5% |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 11 | |
| Ⅲ | 5 | |
| Ⅰ | 3 | 14.3% |
| Ⅴ | 1 | 4.8% |
| Ⅳ | 1 | 4.8% |
Control
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 | |
| | 1 | |
| | 1 | |
| | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 693 | |
| ” | 183 | 19.7% |
| » | 54 | 5.8% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 162 | |
| « | 54 | 21.8% |
| ‘ | 32 | 12.9% |
Format
| Value | Count | Frequency (%) |
| | 7 | |
| | 3 | |
| | 1 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1635323 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 127 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9097167 | |
| Common | 2357988 | 19.9% |
| Han | 103052 | 0.9% |
| Cyrillic | 98175 | 0.8% |
| Hebrew | 80512 | 0.7% |
| Thai | 77395 | 0.7% |
| Katakana | 24826 | 0.2% |
| Hiragana | 24662 | 0.2% |
| Greek | 7349 | 0.1% |
| Arabic | 2217 | < 0.1% |
| Other values (7) | 1804 | < 0.1% |
Most frequent character per script
Han
| Value | Count | Frequency (%) |
| 的 | 2841 | 2.8% |
| 愛 | 2115 | 2.1% |
| 我 | 1986 | 1.9% |
| 你 | 1870 | 1.8% |
| 人 | 1421 | 1.4% |
| 一 | 1366 | 1.3% |
| 不 | 1176 | 1.1% |
| 情 | 1150 | 1.1% |
| 心 | 949 | 0.9% |
| 曲 | 939 | 0.9% |
| Other values (3432) | 87239 |
Hangul
| Value | Count | Frequency (%) |
| 이 | 32 | 2.6% |
| 아 | 26 | 2.2% |
| 사 | 26 | 2.2% |
| 랑 | 23 | 1.9% |
| 가 | 22 | 1.8% |
| 하 | 22 | 1.8% |
| 지 | 20 | 1.7% |
| 고 | 19 | 1.6% |
| 리 | 19 | 1.6% |
| 나 | 19 | 1.6% |
| Other values (324) | 981 |
Latin
| Value | Count | Frequency (%) |
| e | 985846 | 10.8% |
| a | 831482 | 9.1% |
| o | 622983 | 6.8% |
| i | 610106 | 6.7% |
| n | 553982 | 6.1% |
| r | 519226 | 5.7% |
| t | 445938 | 4.9% |
| l | 378543 | 4.2% |
| s | 361331 | 4.0% |
| u | 276252 | 3.0% |
| Other values (213) | 3511478 |
Common
| Value | Count | Frequency (%) |
| 1635323 | ||
| - | 117055 | 5.0% |
| 0 | 63921 | 2.7% |
| . | 57370 | 2.4% |
| 1 | 53122 | 2.3% |
| 2 | 50631 | 2.1% |
| , | 47649 | 2.0% |
| ) | 45124 | 1.9% |
| ( | 45056 | 1.9% |
| ' | 43395 | 1.8% |
| Other values (127) | 199342 | 8.5% |
Katakana
| Value | Count | Frequency (%) |
| ン | 2066 | 8.3% |
| イ | 1257 | 5.1% |
| ラ | 1084 | 4.4% |
| ス | 1056 | 4.3% |
| ル | 977 | 3.9% |
| ト | 912 | 3.7% |
| リ | 778 | 3.1% |
| ッ | 675 | 2.7% |
| ア | 597 | 2.4% |
| マ | 572 | 2.3% |
| Other values (74) | 14852 |
Hiragana
| Value | Count | Frequency (%) |
| の | 3311 | 13.4% |
| い | 1715 | 7.0% |
| な | 1197 | 4.9% |
| た | 948 | 3.8% |
| に | 895 | 3.6% |
| し | 746 | 3.0% |
| て | 746 | 3.0% |
| と | 740 | 3.0% |
| り | 656 | 2.7% |
| ら | 626 | 2.5% |
| Other values (70) | 13082 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 9712 | 9.9% |
| о | 8175 | 8.3% |
| е | 7528 | 7.7% |
| и | 5574 | 5.7% |
| н | 5527 | 5.6% |
| т | 4935 | 5.0% |
| р | 4330 | 4.4% |
| л | 4227 | 4.3% |
| с | 3976 | 4.0% |
| к | 3307 | 3.4% |
| Other values (62) | 40884 |
Thai
| Value | Count | Frequency (%) |
| า | 4594 | 5.9% |
| น | 4527 | 5.8% |
| อ | 4167 | 5.4% |
| ก | 3661 | 4.7% |
| ่ | 3609 | 4.7% |
| เ | 3608 | 4.7% |
| ร | 3527 | 4.6% |
| ั | 3300 | 4.3% |
| ง | 3050 | 3.9% |
| ้ | 2910 | 3.8% |
| Other values (58) | 40442 |
Greek
| Value | Count | Frequency (%) |
| α | 772 | 10.5% |
| ο | 535 | 7.3% |
| ι | 509 | 6.9% |
| τ | 420 | 5.7% |
| ν | 404 | 5.5% |
| ρ | 319 | 4.3% |
| ε | 313 | 4.3% |
| μ | 291 | 4.0% |
| λ | 285 | 3.9% |
| ά | 270 | 3.7% |
| Other values (56) | 3231 |
Arabic
| Value | Count | Frequency (%) |
| ا | 335 | |
| ل | 242 | 10.9% |
| ي | 220 | 9.9% |
| ن | 129 | 5.8% |
| م | 126 | 5.7% |
| ب | 124 | 5.6% |
| و | 112 | 5.1% |
| ر | 100 | 4.5% |
| ه | 72 | 3.2% |
| ح | 71 | 3.2% |
| Other values (28) | 686 |
Bopomofo
| Value | Count | Frequency (%) |
| ㄚ | 6 | 12.8% |
| ㄘ | 2 | 4.3% |
| ㄨ | 2 | 4.3% |
| ㄞ | 2 | 4.3% |
| ㄟ | 2 | 4.3% |
| ㄧ | 2 | 4.3% |
| ㄎ | 1 | 2.1% |
| ㄍ | 1 | 2.1% |
| ㄈ | 1 | 2.1% |
| ㄐ | 1 | 2.1% |
| Other values (27) | 27 |
Hebrew
| Value | Count | Frequency (%) |
| י | 10097 | |
| ו | 7873 | 9.8% |
| ה | 6906 | 8.6% |
| ל | 6228 | 7.7% |
| א | 4922 | 6.1% |
| ר | 4652 | 5.8% |
| ב | 4203 | 5.2% |
| ת | 3934 | 4.9% |
| ש | 3735 | 4.6% |
| מ | 3684 | 4.6% |
| Other values (20) | 24278 |
Lao
| Value | Count | Frequency (%) |
| ນ | 3 | 7.1% |
| ່ | 3 | 7.1% |
| ເ | 2 | 4.8% |
| ມ | 2 | 4.8% |
| ຍ | 2 | 4.8% |
| ອ | 2 | 4.8% |
| ້ | 2 | 4.8% |
| າ | 2 | 4.8% |
| ົ | 2 | 4.8% |
| ັ | 2 | 4.8% |
| Other values (18) | 20 |
Inherited
| Value | Count | Frequency (%) |
| ゙ | 205 | |
| ́ | 101 | |
| ゚ | 45 | 9.7% |
| ̈ | 30 | 6.4% |
| ̃ | 27 | 5.8% |
| ̆ | 17 | 3.6% |
| ̊ | 15 | 3.2% |
| ̧ | 11 | 2.4% |
| ̂ | 10 | 2.1% |
| ̀ | 3 | 0.6% |
| Other values (2) | 2 | 0.4% |
Tibetan
| Value | Count | Frequency (%) |
| ྷ | 1 | |
| ུ | 1 | |
| ཆ | 1 | |
| ེ | 1 | |
| མ | 1 | |
| ལ | 1 | |
| ཨ | 1 | |
| ྨ | 1 | |
| ར | 1 | |
| ཀ | 1 | |
| Other values (2) | 2 |
Georgian
| Value | Count | Frequency (%) |
| ა | 6 | |
| ე | 2 | 11.1% |
| ნ | 2 | 11.1% |
| მ | 2 | 11.1% |
| ი | 2 | 11.1% |
| ტ | 1 | 5.6% |
| დ | 1 | 5.6% |
| ზ | 1 | 5.6% |
| თ | 1 | 5.6% |
Unknown
| Value | Count | Frequency (%) |
| | 4 | |
| | 2 | |
| | 1 | 10.0% |
| | 1 | 10.0% |
| | 1 | 10.0% |
| | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11320559 | |
| None | 136036 | 1.1% |
| CJK | 102995 | 0.9% |
| Cyrillic | 98175 | 0.8% |
| Hebrew | 80512 | 0.7% |
| Thai | 77395 | 0.7% |
| Katakana | 29058 | 0.2% |
| Hiragana | 24912 | 0.2% |
| Arabic | 2218 | < 0.1% |
| Punctuation | 1515 | < 0.1% |
| Other values (24) | 1772 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1635323 | 14.4% | |
| e | 985846 | 8.7% |
| a | 831482 | 7.3% |
| o | 622983 | 5.5% |
| i | 610106 | 5.4% |
| n | 553982 | 4.9% |
| r | 519226 | 4.6% |
| t | 445938 | 3.9% |
| l | 378543 | 3.3% |
| s | 361331 | 3.2% |
| Other values (85) | 4375799 |
None
| Value | Count | Frequency (%) |
| é | 11484 | 8.4% |
| ä | 11361 | 8.4% |
| á | 10970 | 8.1% |
| í | 8657 | 6.4% |
| ó | 8354 | 6.1% |
| ö | 7107 | 5.2% |
| ı | 6723 | 4.9% |
| ü | 6314 | 4.6% |
| å | 4343 | 3.2% |
| ñ | 3452 | 2.5% |
| Other values (243) | 57271 |
Hebrew
| Value | Count | Frequency (%) |
| י | 10097 | |
| ו | 7873 | 9.8% |
| ה | 6906 | 8.6% |
| ל | 6228 | 7.7% |
| א | 4922 | 6.1% |
| ר | 4652 | 5.8% |
| ב | 4203 | 5.2% |
| ת | 3934 | 4.9% |
| ש | 3735 | 4.6% |
| מ | 3684 | 4.6% |
| Other values (20) | 24278 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 9712 | 9.9% |
| о | 8175 | 8.3% |
| е | 7528 | 7.7% |
| и | 5574 | 5.7% |
| н | 5527 | 5.6% |
| т | 4935 | 5.0% |
| р | 4330 | 4.4% |
| л | 4227 | 4.3% |
| с | 3976 | 4.0% |
| к | 3307 | 3.4% |
| Other values (62) | 40884 |
Thai
| Value | Count | Frequency (%) |
| า | 4594 | 5.9% |
| น | 4527 | 5.8% |
| อ | 4167 | 5.4% |
| ก | 3661 | 4.7% |
| ่ | 3609 | 4.7% |
| เ | 3608 | 4.7% |
| ร | 3527 | 4.6% |
| ั | 3300 | 4.3% |
| ง | 3050 | 3.9% |
| ้ | 2910 | 3.8% |
| Other values (58) | 40442 |
Hiragana
| Value | Count | Frequency (%) |
| の | 3311 | 13.3% |
| い | 1715 | 6.9% |
| な | 1197 | 4.8% |
| た | 948 | 3.8% |
| に | 895 | 3.6% |
| し | 746 | 3.0% |
| て | 746 | 3.0% |
| と | 740 | 3.0% |
| り | 656 | 2.6% |
| ら | 626 | 2.5% |
| Other values (72) | 13332 |
CJK
| Value | Count | Frequency (%) |
| 的 | 2841 | 2.8% |
| 愛 | 2115 | 2.1% |
| 我 | 1986 | 1.9% |
| 你 | 1870 | 1.8% |
| 人 | 1421 | 1.4% |
| 一 | 1366 | 1.3% |
| 不 | 1176 | 1.1% |
| 情 | 1150 | 1.1% |
| 心 | 949 | 0.9% |
| 曲 | 939 | 0.9% |
| Other values (3430) | 87182 |
Katakana
| Value | Count | Frequency (%) |
| ー | 2825 | 9.7% |
| ン | 2066 | 7.1% |
| ・ | 1407 | 4.8% |
| イ | 1257 | 4.3% |
| ラ | 1084 | 3.7% |
| ス | 1056 | 3.6% |
| ル | 977 | 3.4% |
| ト | 912 | 3.1% |
| リ | 778 | 2.7% |
| ッ | 675 | 2.3% |
| Other values (76) | 16021 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 693 | |
| … | 247 | 16.3% |
| ” | 183 | 12.1% |
| “ | 162 | 10.7% |
| – | 115 | 7.6% |
| ‘ | 32 | 2.1% |
| — | 18 | 1.2% |
| ‐ | 16 | 1.1% |
| ― | 14 | 0.9% |
| „ | 11 | 0.7% |
| Other values (6) | 24 | 1.6% |
Arabic
| Value | Count | Frequency (%) |
| ا | 335 | |
| ل | 242 | 10.9% |
| ي | 220 | 9.9% |
| ن | 129 | 5.8% |
| م | 126 | 5.7% |
| ب | 124 | 5.6% |
| و | 112 | 5.0% |
| ر | 100 | 4.5% |
| ه | 72 | 3.2% |
| ح | 71 | 3.2% |
| Other values (29) | 687 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 101 | |
| ̈ | 30 | 14.0% |
| ̃ | 27 | 12.6% |
| ̆ | 17 | 7.9% |
| ̊ | 15 | 7.0% |
| ̧ | 11 | 5.1% |
| ̂ | 10 | 4.7% |
| ̀ | 3 | 1.4% |
| ̦ | 1 | 0.5% |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 33 | |
| ★ | 24 | |
| ♡ | 7 | 9.5% |
| ♪ | 2 | 2.7% |
| ♂ | 2 | 2.7% |
| ♭ | 1 | 1.4% |
| ♥ | 1 | 1.4% |
| ♬ | 1 | 1.4% |
| ⚭ | 1 | 1.4% |
| ♅ | 1 | 1.4% |
Hangul
| Value | Count | Frequency (%) |
| 이 | 32 | 2.8% |
| 아 | 26 | 2.3% |
| 사 | 26 | 2.3% |
| 랑 | 23 | 2.0% |
| 가 | 22 | 1.9% |
| 하 | 22 | 1.9% |
| 지 | 20 | 1.7% |
| 고 | 19 | 1.7% |
| 리 | 19 | 1.7% |
| 나 | 19 | 1.7% |
| Other values (301) | 920 |
Small Forms
| Value | Count | Frequency (%) |
| ﹕ | 14 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 12 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 11 | |
| Ⅲ | 5 | |
| Ⅰ | 3 | 14.3% |
| Ⅴ | 1 | 4.8% |
| Ⅳ | 1 | 4.8% |
Arrows
| Value | Count | Frequency (%) |
| → | 10 | |
| ↑ | 5 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 7 | |
| ɪ | 2 | 22.2% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 7 |
Jamo
| Value | Count | Frequency (%) |
| ᅡ | 7 | 11.5% |
| ᅥ | 7 | 11.5% |
| ᅵ | 6 | 9.8% |
| ᄂ | 4 | 6.6% |
| ᄋ | 4 | 6.6% |
| ᄆ | 3 | 4.9% |
| ᆯ | 3 | 4.9% |
| ᄎ | 3 | 4.9% |
| ᄀ | 3 | 4.9% |
| ᄇ | 3 | 4.9% |
| Other values (13) | 18 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ế | 7 | |
| ớ | 3 | |
| ồ | 2 | 7.4% |
| ố | 2 | 7.4% |
| ễ | 2 | 7.4% |
| ạ | 2 | 7.4% |
| ổ | 1 | 3.7% |
| ề | 1 | 3.7% |
| ờ | 1 | 3.7% |
| ụ | 1 | 3.7% |
| Other values (5) | 5 |
Georgian
| Value | Count | Frequency (%) |
| ა | 6 | |
| ე | 2 | 11.1% |
| ნ | 2 | 11.1% |
| მ | 2 | 11.1% |
| ი | 2 | 11.1% |
| ტ | 1 | 5.6% |
| დ | 1 | 5.6% |
| ზ | 1 | 5.6% |
| თ | 1 | 5.6% |
Bopomofo
| Value | Count | Frequency (%) |
| ㄚ | 6 | 12.8% |
| ㄘ | 2 | 4.3% |
| ㄨ | 2 | 4.3% |
| ㄞ | 2 | 4.3% |
| ㄟ | 2 | 4.3% |
| ㄧ | 2 | 4.3% |
| ㄎ | 1 | 2.1% |
| ㄍ | 1 | 2.1% |
| ㄈ | 1 | 2.1% |
| ㄐ | 1 | 2.1% |
| Other values (27) | 27 |
Math Operators
| Value | Count | Frequency (%) |
| ∞ | 5 | |
| − | 1 | 10.0% |
| ≠ | 1 | 10.0% |
| ∆ | 1 | 10.0% |
| ⊰ | 1 | 10.0% |
| ⊱ | 1 | 10.0% |
PUA
| Value | Count | Frequency (%) |
| | 4 | |
| | 2 | |
| | 1 | 10.0% |
| | 1 | 10.0% |
| | 1 | 10.0% |
| | 1 | 10.0% |
Specials
| Value | Count | Frequency (%) |
| � | 3 |
Geometric Shapes
| Value | Count | Frequency (%) |
| △ | 3 | |
| ○ | 3 | |
| ◑ | 3 | |
| ◐ | 3 | |
| ● | 1 | 7.1% |
| ◯ | 1 | 7.1% |
Lao
| Value | Count | Frequency (%) |
| ນ | 3 | 7.1% |
| ່ | 3 | 7.1% |
| ເ | 2 | 4.8% |
| ມ | 2 | 4.8% |
| ຍ | 2 | 4.8% |
| ອ | 2 | 4.8% |
| ້ | 2 | 4.8% |
| າ | 2 | 4.8% |
| ົ | 2 | 4.8% |
| ັ | 2 | 4.8% |
| Other values (18) | 20 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˈ | 2 | |
| ˙ | 1 | |
| ʻ | 1 | |
| ˇ | 1 | |
| ˋ | 1 |
CJK Ext B
| Value | Count | Frequency (%) |
| 𠱁 | 2 |
Tibetan
| Value | Count | Frequency (%) |
| ྷ | 1 | |
| ུ | 1 | |
| ཆ | 1 | |
| ེ | 1 | |
| མ | 1 | |
| ལ | 1 | |
| ཨ | 1 | |
| ྨ | 1 | |
| ར | 1 | |
| ཀ | 1 | |
| Other values (2) | 2 |
Sup Math Operators
| Value | Count | Frequency (%) |
| ⫸ | 1 | |
| ⫷ | 1 | |
| ⨳ | 1 |
VS
| Value | Count | Frequency (%) |
| ︎ | 1 |
Dingbats
| Value | Count | Frequency (%) |
| ❈ | 1 |
popularity
Real number (ℝ)
| Distinct | 101 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.570053 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 44690 |
| Zeros (%) | 7.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 13 |
| median | 27 |
| Q3 | 41 |
| 95-th percentile | 59 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 18.370642 |
|---|---|
| Coefficient of variation (CV) | 0.66632598 |
| Kurtosis | -0.63280211 |
| Mean | 27.570053 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 0.278697 |
| Sum | 16174578 |
| Variance | 337.4805 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 44690 | 7.6% |
| 35 | 12231 | 2.1% |
| 23 | 12139 | 2.1% |
| 1 | 12024 | 2.0% |
| 36 | 11879 | 2.0% |
| 34 | 11328 | 1.9% |
| 27 | 11292 | 1.9% |
| 22 | 11206 | 1.9% |
| 33 | 11174 | 1.9% |
| 24 | 11148 | 1.9% |
| Other values (91) | 437561 |
| Value | Count | Frequency (%) |
| 0 | 44690 | |
| 1 | 12024 | 2.0% |
| 2 | 9639 | 1.6% |
| 3 | 8154 | 1.4% |
| 4 | 7733 | 1.3% |
| 5 | 7730 | 1.3% |
| 6 | 7659 | 1.3% |
| 7 | 7726 | 1.3% |
| 8 | 7988 | 1.4% |
| 9 | 8265 | 1.4% |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 99 | 1 | < 0.1% |
| 98 | 1 | < 0.1% |
| 97 | 2 | < 0.1% |
| 96 | 2 | < 0.1% |
| 95 | 1 | < 0.1% |
| 94 | 6 | |
| 93 | 2 | < 0.1% |
| 92 | 10 | |
| 91 | 11 |
duration_ms
Real number (ℝ)
| Distinct | 123122 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 230051.17 |
| Minimum | 3344 |
|---|---|
| Maximum | 5621218 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 3344 |
|---|---|
| 5-th percentile | 97307 |
| Q1 | 175093 |
| median | 214893 |
| Q3 | 263867 |
| 95-th percentile | 382333 |
| Maximum | 5621218 |
| Range | 5617874 |
| Interquartile range (IQR) | 88774 |
Descriptive statistics
| Standard deviation | 126526.09 |
|---|---|
| Coefficient of variation (CV) | 0.54999107 |
| Kurtosis | 241.06655 |
| Mean | 230051.17 |
| Median Absolute Deviation (MAD) | 43838.5 |
| Skewness | 10.325622 |
| Sum | 1.3496458 × 1011 |
| Variance | 1.6008851 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 240000 | 215 | < 0.1% |
| 192000 | 201 | < 0.1% |
| 180000 | 199 | < 0.1% |
| 216000 | 184 | < 0.1% |
| 210000 | 171 | < 0.1% |
| 184000 | 166 | < 0.1% |
| 200000 | 166 | < 0.1% |
| 208000 | 162 | < 0.1% |
| 228000 | 152 | < 0.1% |
| 198000 | 151 | < 0.1% |
| Other values (123112) | 584905 |
| Value | Count | Frequency (%) |
| 3344 | 4 | |
| 4000 | 8 | |
| 4937 | 1 | < 0.1% |
| 5108 | 1 | < 0.1% |
| 5991 | 1 | < 0.1% |
| 6360 | 1 | < 0.1% |
| 6362 | 1 | < 0.1% |
| 6373 | 3 | < 0.1% |
| 7523 | 1 | < 0.1% |
| 8594 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5621218 | 1 | |
| 5403500 | 1 | |
| 5042185 | 1 | |
| 4995083 | 1 | |
| 4864333 | 1 | |
| 4800118 | 1 | |
| 4797258 | 1 | |
| 4792587 | 1 | |
| 4786672 | 1 | |
| 4775518 | 1 |
explicit
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| 0 | |
|---|---|
| 1 | 25864 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 586672 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 560808 | |
| 1 | 25864 | 4.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 560808 | |
| 1 | 25864 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 560808 | |
| 1 | 25864 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 586672 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 560808 | |
| 1 | 25864 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 586672 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 560808 | |
| 1 | 25864 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 586672 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 560808 | |
| 1 | 25864 | 4.4% |
artists
Categorical
| Distinct | 114030 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| ['Die drei ???'] | 3856 |
|---|---|
| ['TKKG Retro-Archiv'] | 2006 |
| ['Benjamin Blümchen'] | 1503 |
| ['Bibi Blocksberg'] | 1472 |
| ['Lata Mangeshkar'] | 1373 |
| Other values (114025) |
Length
| Max length | 934 |
|---|---|
| Median length | 492 |
| Mean length | 21.612956 |
| Min length | 4 |
Characters and Unicode
| Total characters | 12679716 |
|---|---|
| Distinct characters | 2156 |
| Distinct categories | 20 ? |
| Distinct scripts | 13 ? |
| Distinct blocks | 22 ? |
Unique
| Unique | 66232 ? |
|---|---|
| Unique (%) | 11.3% |
Sample
| 1st row | ['Uli'] |
|---|---|
| 2nd row | ['Fernando Pessoa'] |
| 3rd row | ['Ignacio Corsini'] |
| 4th row | ['Ignacio Corsini'] |
| 5th row | ['Dick Haymes'] |
Common Values
| Value | Count | Frequency (%) |
| ['Die drei ???'] | 3856 | 0.7% |
| ['TKKG Retro-Archiv'] | 2006 | 0.3% |
| ['Benjamin Blümchen'] | 1503 | 0.3% |
| ['Bibi Blocksberg'] | 1472 | 0.3% |
| ['Lata Mangeshkar'] | 1373 | 0.2% |
| ['Bibi und Tina'] | 927 | 0.2% |
| ['Tintin', 'Tomas Bolme', 'Bert-Åke Varg'] | 905 | 0.2% |
| ['Francisco Canaro'] | 891 | 0.2% |
| ['Ella Fitzgerald'] | 870 | 0.1% |
| ['Tadeusz Dolega Mostowicz'] | 838 | 0.1% |
| Other values (114020) | 572031 |
Length
| Value | Count | Frequency (%) |
| the | 29997 | 1.9% |
| 22820 | 1.5% | |
| orchestra | 12441 | 0.8% |
| de | 10079 | 0.6% |
| los | 9267 | 0.6% |
| die | 5267 | 0.3% |
| la | 4812 | 0.3% |
| del | 4260 | 0.3% |
| john | 4178 | 0.3% |
| his | 4157 | 0.3% |
| Other values (80592) | 1451219 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 1507306 | 11.9% |
| 971828 | 7.7% | |
| a | 885658 | 7.0% |
| e | 763743 | 6.0% |
| i | 607358 | 4.8% |
| ] | 586740 | 4.6% |
| [ | 586740 | 4.6% |
| r | 583100 | 4.6% |
| n | 578674 | 4.6% |
| o | 557799 | 4.4% |
| Other values (2146) | 5050770 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7049614 | |
| Other Punctuation | 1762841 | 13.9% |
| Uppercase Letter | 1618636 | 12.8% |
| Space Separator | 971828 | 7.7% |
| Close Punctuation | 587847 | 4.6% |
| Open Punctuation | 587845 | 4.6% |
| Other Letter | 58839 | 0.5% |
| Decimal Number | 19835 | 0.2% |
| Dash Punctuation | 14763 | 0.1% |
| Nonspacing Mark | 5568 | < 0.1% |
| Other values (10) | 2100 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ร | 1888 | 3.2% |
| น | 1619 | 2.8% |
| อ | 1187 | 2.0% |
| า | 1120 | 1.9% |
| ว | 1083 | 1.8% |
| ม | 955 | 1.6% |
| เ | 917 | 1.6% |
| ส | 915 | 1.6% |
| ท | 736 | 1.3% |
| 李 | 731 | 1.2% |
| Other values (1740) | 47688 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 885658 | |
| e | 763743 | |
| i | 607358 | 8.6% |
| r | 583100 | 8.3% |
| n | 578674 | 8.2% |
| o | 557799 | 7.9% |
| l | 397378 | 5.6% |
| s | 395691 | 5.6% |
| t | 328892 | 4.7% |
| h | 256752 | 3.6% |
| Other values (169) | 1694569 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 138185 | 8.5% |
| M | 122570 | 7.6% |
| B | 114524 | 7.1% |
| C | 102008 | 6.3% |
| A | 100915 | 6.2% |
| T | 98618 | 6.1% |
| D | 83448 | 5.2% |
| L | 82798 | 5.1% |
| R | 79486 | 4.9% |
| P | 76285 | 4.7% |
| Other values (116) | 619799 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1507306 | |
| , | 173449 | 9.8% |
| . | 31571 | 1.8% |
| & | 17636 | 1.0% |
| " | 17113 | 1.0% |
| ? | 11659 | 0.7% |
| / | 1513 | 0.1% |
| ! | 1455 | 0.1% |
| : | 250 | < 0.1% |
| ・ | 233 | < 0.1% |
| Other values (16) | 656 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ิ | 1404 | |
| ์ | 1062 | |
| ั | 769 | |
| ี | 573 | |
| ุ | 458 | 8.2% |
| ่ | 391 | 7.0% |
| ้ | 280 | 5.0% |
| ู | 262 | 4.7% |
| ๊ | 141 | 2.5% |
| ็ | 98 | 1.8% |
| Other values (4) | 130 | 2.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3345 | |
| 2 | 3192 | |
| 0 | 2836 | |
| 4 | 1944 | |
| 3 | 1825 | |
| 5 | 1705 | |
| 7 | 1357 | |
| 9 | 1310 | 6.6% |
| 8 | 1211 | 6.1% |
| 6 | 1110 | 5.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 186 | |
| | | 25 | 10.6% |
| = | 12 | 5.1% |
| × | 3 | 1.3% |
| ~ | 3 | 1.3% |
| > | 2 | 0.8% |
| √ | 2 | 0.8% |
| ⇔ | 1 | 0.4% |
| < | 1 | 0.4% |
| ∀ | 1 | 0.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 69 | |
| ` | 21 | 21.2% |
| ^ | 4 | 4.0% |
| ¨ | 3 | 3.0% |
| ¯ | 1 | 1.0% |
| ゛ | 1 | 1.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ☆ | 21 | |
| ° | 10 | |
| № | 5 | 11.6% |
| ® | 4 | 9.3% |
| ★ | 2 | 4.7% |
| © | 1 | 2.3% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 456 | |
| ¥ | 11 | 2.3% |
| € | 3 | 0.6% |
| ¢ | 1 | 0.2% |
| £ | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 586740 | |
| ) | 1090 | 0.2% |
| ] | 10 | < 0.1% |
| ) | 7 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 586740 | |
| ( | 1087 | 0.2% |
| [ | 10 | < 0.1% |
| ( | 8 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14649 | |
| – | 110 | 0.7% |
| 〜 | 4 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 156 | |
| » | 56 | 22.9% |
| ” | 33 | 13.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 55 | |
| “ | 10 | 14.7% |
| ‘ | 3 | 4.4% |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 869 | |
| 々 | 7 | 0.8% |
Other Number
| Value | Count | Frequency (%) |
| ² | 3 | |
| ³ | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 971828 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 56 |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8626881 | |
| Common | 3947045 | |
| Cyrillic | 35174 | 0.3% |
| Han | 30733 | 0.2% |
| Thai | 24963 | 0.2% |
| Greek | 6209 | < 0.1% |
| Katakana | 5238 | < 0.1% |
| Hebrew | 1830 | < 0.1% |
| Hiragana | 1050 | < 0.1% |
| Arabic | 467 | < 0.1% |
| Other values (3) | 126 | < 0.1% |
Most frequent character per script
Han
| Value | Count | Frequency (%) |
| 李 | 731 | 2.4% |
| 林 | 505 | 1.6% |
| 樂 | 403 | 1.3% |
| 淑 | 396 | 1.3% |
| 吳 | 392 | 1.3% |
| 蔡 | 387 | 1.3% |
| 陳 | 376 | 1.2% |
| 雲 | 357 | 1.2% |
| 許 | 325 | 1.1% |
| 爾 | 314 | 1.0% |
| Other values (1430) | 26547 |
Latin
| Value | Count | Frequency (%) |
| a | 885658 | 10.3% |
| e | 763743 | 8.9% |
| i | 607358 | 7.0% |
| r | 583100 | 6.8% |
| n | 578674 | 6.7% |
| o | 557799 | 6.5% |
| l | 397378 | 4.6% |
| s | 395691 | 4.6% |
| t | 328892 | 3.8% |
| h | 256752 | 3.0% |
| Other values (166) | 3271836 |
Common
| Value | Count | Frequency (%) |
| ' | 1507306 | |
| 971828 | ||
| ] | 586740 | 14.9% |
| [ | 586740 | 14.9% |
| , | 173449 | 4.4% |
| . | 31571 | 0.8% |
| & | 17636 | 0.4% |
| " | 17113 | 0.4% |
| - | 14649 | 0.4% |
| ? | 11659 | 0.3% |
| Other values (75) | 28354 | 0.7% |
Katakana
| Value | Count | Frequency (%) |
| ル | 484 | 9.2% |
| ン | 447 | 8.5% |
| ス | 422 | 8.1% |
| オ | 385 | 7.4% |
| ズ | 341 | 6.5% |
| タ | 334 | 6.4% |
| サ | 332 | 6.3% |
| ザ | 306 | 5.8% |
| シ | 127 | 2.4% |
| イ | 104 | 2.0% |
| Other values (70) | 1956 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 3565 | 10.1% |
| и | 2858 | 8.1% |
| о | 2645 | 7.5% |
| н | 2624 | 7.5% |
| е | 2133 | 6.1% |
| р | 2083 | 5.9% |
| в | 1795 | 5.1% |
| л | 1487 | 4.2% |
| с | 1284 | 3.7% |
| т | 1263 | 3.6% |
| Other values (58) | 13437 |
Hiragana
| Value | Count | Frequency (%) |
| み | 62 | 5.9% |
| ん | 61 | 5.8% |
| と | 58 | 5.5% |
| え | 58 | 5.5% |
| い | 57 | 5.4% |
| の | 39 | 3.7% |
| は | 39 | 3.7% |
| ま | 35 | 3.3% |
| さ | 33 | 3.1% |
| ど | 32 | 3.0% |
| Other values (54) | 576 |
Thai
| Value | Count | Frequency (%) |
| ร | 1888 | 7.6% |
| น | 1619 | 6.5% |
| ิ | 1404 | 5.6% |
| อ | 1187 | 4.8% |
| า | 1120 | 4.5% |
| ว | 1083 | 4.3% |
| ์ | 1062 | 4.3% |
| ม | 955 | 3.8% |
| เ | 917 | 3.7% |
| ส | 915 | 3.7% |
| Other values (52) | 12813 |
Hangul
| Value | Count | Frequency (%) |
| 이 | 8 | 7.3% |
| 정 | 6 | 5.5% |
| 지 | 5 | 4.6% |
| 유 | 3 | 2.8% |
| 김 | 3 | 2.8% |
| 현 | 3 | 2.8% |
| 성 | 3 | 2.8% |
| 미 | 3 | 2.8% |
| 나 | 3 | 2.8% |
| 수 | 3 | 2.8% |
| Other values (51) | 69 |
Greek
| Value | Count | Frequency (%) |
| ς | 562 | 9.1% |
| α | 514 | 8.3% |
| ο | 418 | 6.7% |
| ρ | 362 | 5.8% |
| τ | 361 | 5.8% |
| η | 310 | 5.0% |
| ν | 250 | 4.0% |
| ι | 233 | 3.8% |
| λ | 213 | 3.4% |
| κ | 213 | 3.4% |
| Other values (44) | 2773 |
Hebrew
| Value | Count | Frequency (%) |
| י | 314 | |
| ו | 195 | |
| ר | 144 | 7.9% |
| ה | 123 | 6.7% |
| ב | 112 | 6.1% |
| נ | 107 | 5.8% |
| א | 107 | 5.8% |
| ל | 104 | 5.7% |
| ן | 82 | 4.5% |
| מ | 57 | 3.1% |
| Other values (18) | 485 |
Arabic
| Value | Count | Frequency (%) |
| م | 51 | |
| ي | 49 | 10.5% |
| ا | 37 | 7.9% |
| د | 34 | 7.3% |
| ل | 34 | 7.3% |
| ز | 33 | 7.1% |
| ف | 31 | 6.6% |
| ر | 26 | 5.6% |
| ح | 25 | 5.4% |
| و | 22 | 4.7% |
| Other values (18) | 125 |
Georgian
| Value | Count | Frequency (%) |
| ა | 4 | |
| ი | 3 | |
| ე | 1 | 7.1% |
| ძ | 1 | 7.1% |
| მ | 1 | 7.1% |
| ს | 1 | 7.1% |
| დ | 1 | 7.1% |
| ზ | 1 | 7.1% |
| ნ | 1 | 7.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12482277 | |
| None | 96402 | 0.8% |
| Cyrillic | 35174 | 0.3% |
| CJK | 30717 | 0.2% |
| Thai | 24963 | 0.2% |
| Katakana | 6340 | 0.1% |
| Hebrew | 1830 | < 0.1% |
| Hiragana | 1051 | < 0.1% |
| Arabic | 467 | < 0.1% |
| Punctuation | 316 | < 0.1% |
| Other values (12) | 179 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 1507306 | 12.1% |
| 971828 | 7.8% | |
| a | 885658 | 7.1% |
| e | 763743 | 6.1% |
| i | 607358 | 4.9% |
| ] | 586740 | 4.7% |
| [ | 586740 | 4.7% |
| r | 583100 | 4.7% |
| n | 578674 | 4.6% |
| o | 557799 | 4.5% |
| Other values (83) | 4853331 |
None
| Value | Count | Frequency (%) |
| é | 14064 | 14.6% |
| á | 9701 | 10.1% |
| ü | 7827 | 8.1% |
| ó | 6875 | 7.1% |
| í | 6494 | 6.7% |
| ö | 5330 | 5.5% |
| ı | 2911 | 3.0% |
| ä | 2518 | 2.6% |
| ç | 2191 | 2.3% |
| ú | 2179 | 2.3% |
| Other values (192) | 36312 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 3565 | 10.1% |
| и | 2858 | 8.1% |
| о | 2645 | 7.5% |
| н | 2624 | 7.5% |
| е | 2133 | 6.1% |
| р | 2083 | 5.9% |
| в | 1795 | 5.1% |
| л | 1487 | 4.2% |
| с | 1284 | 3.7% |
| т | 1263 | 3.6% |
| Other values (58) | 13437 |
Thai
| Value | Count | Frequency (%) |
| ร | 1888 | 7.6% |
| น | 1619 | 6.5% |
| ิ | 1404 | 5.6% |
| อ | 1187 | 4.8% |
| า | 1120 | 4.5% |
| ว | 1083 | 4.3% |
| ์ | 1062 | 4.3% |
| ม | 955 | 3.8% |
| เ | 917 | 3.7% |
| ส | 915 | 3.7% |
| Other values (52) | 12813 |
Katakana
| Value | Count | Frequency (%) |
| ー | 869 | 13.7% |
| ル | 484 | 7.6% |
| ン | 447 | 7.1% |
| ス | 422 | 6.7% |
| オ | 385 | 6.1% |
| ズ | 341 | 5.4% |
| タ | 334 | 5.3% |
| サ | 332 | 5.2% |
| ザ | 306 | 4.8% |
| ・ | 233 | 3.7% |
| Other values (72) | 2187 |
CJK
| Value | Count | Frequency (%) |
| 李 | 731 | 2.4% |
| 林 | 505 | 1.6% |
| 樂 | 403 | 1.3% |
| 淑 | 396 | 1.3% |
| 吳 | 392 | 1.3% |
| 蔡 | 387 | 1.3% |
| 陳 | 376 | 1.2% |
| 雲 | 357 | 1.2% |
| 許 | 325 | 1.1% |
| 爾 | 314 | 1.0% |
| Other values (1427) | 26531 |
Hebrew
| Value | Count | Frequency (%) |
| י | 314 | |
| ו | 195 | |
| ר | 144 | 7.9% |
| ה | 123 | 6.7% |
| ב | 112 | 6.1% |
| נ | 107 | 5.8% |
| א | 107 | 5.8% |
| ל | 104 | 5.7% |
| ן | 82 | 4.5% |
| מ | 57 | 3.1% |
| Other values (18) | 485 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 156 | |
| – | 110 | |
| ” | 33 | 10.4% |
| “ | 10 | 3.2% |
| ‘ | 3 | 0.9% |
| • | 3 | 0.9% |
| † | 1 | 0.3% |
Hiragana
| Value | Count | Frequency (%) |
| み | 62 | 5.9% |
| ん | 61 | 5.8% |
| と | 58 | 5.5% |
| え | 58 | 5.5% |
| い | 57 | 5.4% |
| の | 39 | 3.7% |
| は | 39 | 3.7% |
| ま | 35 | 3.3% |
| さ | 33 | 3.1% |
| ど | 32 | 3.0% |
| Other values (55) | 577 |
Arabic
| Value | Count | Frequency (%) |
| م | 51 | |
| ي | 49 | 10.5% |
| ا | 37 | 7.9% |
| د | 34 | 7.3% |
| ل | 34 | 7.3% |
| ز | 33 | 7.1% |
| ف | 31 | 6.6% |
| ر | 26 | 5.6% |
| ح | 25 | 5.4% |
| و | 22 | 4.7% |
| Other values (18) | 125 |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 21 | |
| ★ | 2 | 8.7% |
Hangul
| Value | Count | Frequency (%) |
| 이 | 8 | 7.3% |
| 정 | 6 | 5.5% |
| 지 | 5 | 4.6% |
| 유 | 3 | 2.8% |
| 김 | 3 | 2.8% |
| 현 | 3 | 2.8% |
| 성 | 3 | 2.8% |
| 미 | 3 | 2.8% |
| 나 | 3 | 2.8% |
| 수 | 3 | 2.8% |
| Other values (51) | 69 |
CJK Compat Ideographs
| Value | Count | Frequency (%) |
| 﨑 | 7 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ọ | 6 | |
| ṣ | 1 | 12.5% |
| ữ | 1 | 12.5% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 5 |
Georgian
| Value | Count | Frequency (%) |
| ა | 4 | |
| ი | 3 | |
| ე | 1 | 7.1% |
| ძ | 1 | 7.1% |
| მ | 1 | 7.1% |
| ს | 1 | 7.1% |
| დ | 1 | 7.1% |
| ზ | 1 | 7.1% |
| ნ | 1 | 7.1% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 3 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 3 |
CJK Ext B
| Value | Count | Frequency (%) |
| 𤒹 | 2 |
Math Operators
| Value | Count | Frequency (%) |
| √ | 2 | |
| ∀ | 1 |
Arrows
| Value | Count | Frequency (%) |
| ⇔ | 1 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 1 |
id_artists
Categorical
| Distinct | 115062 |
|---|---|
| Distinct (%) | 19.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| ['3meJIgRw7YleJrmbpbJK6S'] | 3856 |
|---|---|
| ['0i38tQX5j4gZ0KS3eCMoIl'] | 2006 |
| ['1l6d0RIxTL3JytlLGvWzYe'] | 1503 |
| ['3t2iKODSDyzoDJw7AsD99u'] | 1472 |
| ['61JrslREXq98hurYL2hYoc'] | 1373 |
| Other values (115057) |
Length
| Max length | 1508 |
|---|---|
| Median length | 26 |
| Mean length | 33.556093 |
| Min length | 26 |
Characters and Unicode
| Total characters | 19686420 |
|---|---|
| Distinct characters | 67 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 67145 ? |
|---|---|
| Unique (%) | 11.4% |
Sample
| 1st row | ['45tIt06XoI0Iio4LBEVpls'] |
|---|---|
| 2nd row | ['14jtPCOoNZwquk5wd9DxrY'] |
| 3rd row | ['5LiOoJbxVSAMkBS2fUm3X2'] |
| 4th row | ['5LiOoJbxVSAMkBS2fUm3X2'] |
| 5th row | ['3BiJGZsyX9sJchTqcSA7Su'] |
Common Values
| Value | Count | Frequency (%) |
| ['3meJIgRw7YleJrmbpbJK6S'] | 3856 | 0.7% |
| ['0i38tQX5j4gZ0KS3eCMoIl'] | 2006 | 0.3% |
| ['1l6d0RIxTL3JytlLGvWzYe'] | 1503 | 0.3% |
| ['3t2iKODSDyzoDJw7AsD99u'] | 1472 | 0.3% |
| ['61JrslREXq98hurYL2hYoc'] | 1373 | 0.2% |
| ['2x8vG4f0HYXzMEo3xNsoiI'] | 927 | 0.2% |
| ['6aMD1KAa5i3Myy61cR8FiW', '7HjbJ8V87zrxkSzL1KieQk', '71ADe4Zg9UyE8WQEHbJSXM'] | 905 | 0.2% |
| ['2maQMqxNnlRrBrS1oAsrX9'] | 891 | 0.2% |
| ['5V0MlUE1Bft0mbLlND7FJz'] | 870 | 0.1% |
| ['4eeMulNeqpZGBxybCxZOdC'] | 838 | 0.1% |
| Other values (115052) | 572031 |
Length
| Value | Count | Frequency (%) |
| 3mejigrw7ylejrmbpbjk6s | 3856 | 0.5% |
| 61jrslrexq98huryl2hyoc | 2605 | 0.3% |
| 5aiqb5nvvvmfsvsdexz408 | 2020 | 0.3% |
| 2maqmqxnnlrrbrs1oasrx9 | 2010 | 0.3% |
| 0i38tqx5j4gz0ks3ecmoil | 2006 | 0.3% |
| 4njhfmfw43rlbljqvxdurs | 1821 | 0.2% |
| 0gxdpqwyndodn7fb0rdn8j | 1553 | 0.2% |
| 1l6d0rixtl3jytllgvwzye | 1503 | 0.2% |
| 3t2ikodsdyzodjw7asd99u | 1472 | 0.2% |
| 2woqmjp9tyabvthdosotus | 1253 | 0.2% |
| Other values (98494) | 737071 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 1514340 | 7.7% |
| [ | 586672 | 3.0% |
| ] | 586672 | 3.0% |
| 0 | 365662 | 1.9% |
| 4 | 359536 | 1.8% |
| 2 | 358258 | 1.8% |
| 5 | 358255 | 1.8% |
| 3 | 353126 | 1.8% |
| 1 | 350394 | 1.8% |
| 6 | 344241 | 1.7% |
| Other values (57) | 14509264 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6705473 | |
| Uppercase Letter | 6611755 | |
| Decimal Number | 3340512 | |
| Other Punctuation | 1684838 | 8.6% |
| Open Punctuation | 586672 | 3.0% |
| Close Punctuation | 586672 | 3.0% |
| Space Separator | 170498 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 275758 | 4.1% |
| l | 267655 | 4.0% |
| m | 266604 | 4.0% |
| b | 264203 | 3.9% |
| y | 263309 | 3.9% |
| x | 263030 | 3.9% |
| r | 262848 | 3.9% |
| s | 262422 | 3.9% |
| q | 261029 | 3.9% |
| o | 260528 | 3.9% |
| Other values (16) | 4058087 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 267989 | 4.1% |
| D | 267159 | 4.0% |
| J | 266786 | 4.0% |
| C | 261777 | 4.0% |
| X | 261442 | 4.0% |
| R | 259970 | 3.9% |
| Y | 259593 | 3.9% |
| O | 257776 | 3.9% |
| L | 254702 | 3.9% |
| I | 254611 | 3.9% |
| Other values (16) | 3999950 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 365662 | |
| 4 | 359536 | |
| 2 | 358258 | |
| 5 | 358255 | |
| 3 | 353126 | |
| 1 | 350394 | |
| 6 | 344241 | |
| 7 | 335002 | |
| 8 | 261694 | |
| 9 | 254344 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1514340 | |
| , | 170498 | 10.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 586672 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 586672 |
Space Separator
| Value | Count | Frequency (%) |
| 170498 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13317228 | |
| Common | 6369192 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 275758 | 2.1% |
| S | 267989 | 2.0% |
| l | 267655 | 2.0% |
| D | 267159 | 2.0% |
| J | 266786 | 2.0% |
| m | 266604 | 2.0% |
| b | 264203 | 2.0% |
| y | 263309 | 2.0% |
| x | 263030 | 2.0% |
| r | 262848 | 2.0% |
| Other values (42) | 10651887 |
Common
| Value | Count | Frequency (%) |
| ' | 1514340 | |
| [ | 586672 | 9.2% |
| ] | 586672 | 9.2% |
| 0 | 365662 | 5.7% |
| 4 | 359536 | 5.6% |
| 2 | 358258 | 5.6% |
| 5 | 358255 | 5.6% |
| 3 | 353126 | 5.5% |
| 1 | 350394 | 5.5% |
| 6 | 344241 | 5.4% |
| Other values (5) | 1192036 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19686420 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 1514340 | 7.7% |
| [ | 586672 | 3.0% |
| ] | 586672 | 3.0% |
| 0 | 365662 | 1.9% |
| 4 | 359536 | 1.8% |
| 2 | 358258 | 1.8% |
| 5 | 358255 | 1.8% |
| 3 | 353126 | 1.8% |
| 1 | 350394 | 1.8% |
| 6 | 344241 | 1.7% |
| Other values (57) | 14509264 |
release_date
Categorical
| Distinct | 19700 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| 1998-01-01 | 2893 |
|---|---|
| 1997-01-01 | 2892 |
| 1995 | 2871 |
| 1997 | 2811 |
| 1996 | 2776 |
| Other values (19695) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.5933537 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5041480 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2176 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1922-02-22 |
|---|---|
| 2nd row | 1922-06-01 |
| 3rd row | 1922-03-21 |
| 4th row | 1922-03-21 |
| 5th row | 1922 |
Common Values
| Value | Count | Frequency (%) |
| 1998-01-01 | 2893 | 0.5% |
| 1997-01-01 | 2892 | 0.5% |
| 1995 | 2871 | 0.5% |
| 1997 | 2811 | 0.5% |
| 1996 | 2776 | 0.5% |
| 1990-01-01 | 2752 | 0.5% |
| 1998 | 2726 | 0.5% |
| 1996-01-01 | 2705 | 0.5% |
| 1994 | 2611 | 0.4% |
| 1995-01-01 | 2575 | 0.4% |
| Other values (19690) | 559060 |
Length
| Value | Count | Frequency (%) |
| 1998-01-01 | 2893 | 0.5% |
| 1997-01-01 | 2892 | 0.5% |
| 1995 | 2871 | 0.5% |
| 1997 | 2811 | 0.5% |
| 1996 | 2776 | 0.5% |
| 1990-01-01 | 2752 | 0.5% |
| 1998 | 2726 | 0.5% |
| 1996-01-01 | 2705 | 0.5% |
| 1994 | 2611 | 0.4% |
| 1995-01-01 | 2575 | 0.4% |
| Other values (19690) | 559060 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1100398 | |
| 0 | 1000347 | |
| - | 898264 | |
| 9 | 604970 | |
| 2 | 479776 | |
| 8 | 194866 | 3.9% |
| 7 | 173992 | 3.5% |
| 6 | 162608 | 3.2% |
| 5 | 153465 | 3.0% |
| 3 | 143262 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4143216 | |
| Dash Punctuation | 898264 | 17.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1100398 | |
| 0 | 1000347 | |
| 9 | 604970 | |
| 2 | 479776 | |
| 8 | 194866 | 4.7% |
| 7 | 173992 | 4.2% |
| 6 | 162608 | 3.9% |
| 5 | 153465 | 3.7% |
| 3 | 143262 | 3.5% |
| 4 | 129532 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 898264 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5041480 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1100398 | |
| 0 | 1000347 | |
| - | 898264 | |
| 9 | 604970 | |
| 2 | 479776 | |
| 8 | 194866 | 3.9% |
| 7 | 173992 | 3.5% |
| 6 | 162608 | 3.2% |
| 5 | 153465 | 3.0% |
| 3 | 143262 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5041480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1100398 | |
| 0 | 1000347 | |
| - | 898264 | |
| 9 | 604970 | |
| 2 | 479776 | |
| 8 | 194866 | 3.9% |
| 7 | 173992 | 3.5% |
| 6 | 162608 | 3.2% |
| 5 | 153465 | 3.0% |
| 3 | 143262 | 2.8% |
danceability
Real number (ℝ)
| Distinct | 1285 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.56359382 |
| Minimum | 0 |
|---|---|
| Maximum | 0.991 |
| Zeros | 328 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.267 |
| Q1 | 0.453 |
| median | 0.577 |
| Q3 | 0.686 |
| 95-th percentile | 0.815 |
| Maximum | 0.991 |
| Range | 0.991 |
| Interquartile range (IQR) | 0.233 |
Descriptive statistics
| Standard deviation | 0.16610265 |
|---|---|
| Coefficient of variation (CV) | 0.2947205 |
| Kurtosis | -0.27402096 |
| Mean | 0.56359382 |
| Median Absolute Deviation (MAD) | 0.115 |
| Skewness | -0.33082544 |
| Sum | 330644.71 |
| Variance | 0.027590092 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.637 | 1483 | 0.3% |
| 0.629 | 1453 | 0.2% |
| 0.602 | 1446 | 0.2% |
| 0.595 | 1444 | 0.2% |
| 0.616 | 1442 | 0.2% |
| 0.63 | 1440 | 0.2% |
| 0.62 | 1437 | 0.2% |
| 0.632 | 1433 | 0.2% |
| 0.607 | 1431 | 0.2% |
| 0.565 | 1429 | 0.2% |
| Other values (1275) | 572234 |
| Value | Count | Frequency (%) |
| 0 | 328 | |
| 0.0532 | 1 | < 0.1% |
| 0.0546 | 1 | < 0.1% |
| 0.0559 | 2 | < 0.1% |
| 0.0562 | 1 | < 0.1% |
| 0.0569 | 2 | < 0.1% |
| 0.057 | 1 | < 0.1% |
| 0.0572 | 1 | < 0.1% |
| 0.0574 | 2 | < 0.1% |
| 0.0579 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.991 | 1 | < 0.1% |
| 0.988 | 3 | |
| 0.987 | 2 | < 0.1% |
| 0.986 | 3 | |
| 0.985 | 6 | |
| 0.984 | 5 | |
| 0.983 | 2 | < 0.1% |
| 0.982 | 4 | |
| 0.981 | 1 | < 0.1% |
| 0.98 | 7 |
energy
Real number (ℝ)
| Distinct | 2571 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.54203599 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 33 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.12 |
| Q1 | 0.343 |
| median | 0.549 |
| Q3 | 0.748 |
| 95-th percentile | 0.931 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.405 |
Descriptive statistics
| Standard deviation | 0.25192294 |
|---|---|
| Coefficient of variation (CV) | 0.46477161 |
| Kurtosis | -0.96379157 |
| Mean | 0.54203599 |
| Median Absolute Deviation (MAD) | 0.202 |
| Skewness | -0.13138282 |
| Sum | 317997.34 |
| Variance | 0.063465168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.526 | 847 | 0.1% |
| 0.538 | 846 | 0.1% |
| 0.716 | 836 | 0.1% |
| 0.448 | 835 | 0.1% |
| 0.497 | 832 | 0.1% |
| 0.534 | 826 | 0.1% |
| 0.53 | 826 | 0.1% |
| 0.666 | 823 | 0.1% |
| 0.726 | 821 | 0.1% |
| 0.499 | 820 | 0.1% |
| Other values (2561) | 578360 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 1.97 × 10-5 | 2 | < 0.1% |
| 1.98 × 10-5 | 1 | < 0.1% |
| 1.99 × 10-5 | 2 | < 0.1% |
| 2 × 10-5 | 3 | < 0.1% |
| 2.01 × 10-5 | 10 | < 0.1% |
| 2.02 × 10-5 | 12 | < 0.1% |
| 2.03 × 10-5 | 36 | |
| 2.8 × 10-5 | 1 | < 0.1% |
| 3.05 × 10-5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 64 | < 0.1% |
| 0.999 | 217 | |
| 0.998 | 223 | |
| 0.997 | 245 | |
| 0.996 | 255 | |
| 0.995 | 312 | |
| 0.994 | 267 | |
| 0.993 | 256 | |
| 0.992 | 262 | |
| 0.991 | 311 |
key
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.2216025 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 74950 |
| Zeros (%) | 12.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.5194231 |
|---|---|
| Coefficient of variation (CV) | 0.67401207 |
| Kurtosis | -1.2659395 |
| Mean | 5.2216025 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.0013936386 |
| Sum | 3063368 |
| Variance | 12.386339 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 74950 | |
| 7 | 73779 | |
| 2 | 66552 | |
| 9 | 65128 | |
| 5 | 53614 | |
| 4 | 48220 | |
| 1 | 41736 | |
| 11 | 39132 | |
| 10 | 37710 | |
| 8 | 33460 | |
| Other values (2) | 52391 |
| Value | Count | Frequency (%) |
| 0 | 74950 | |
| 1 | 41736 | |
| 2 | 66552 | |
| 3 | 21535 | 3.7% |
| 4 | 48220 | |
| 5 | 53614 | |
| 6 | 30856 | |
| 7 | 73779 | |
| 8 | 33460 | |
| 9 | 65128 |
| Value | Count | Frequency (%) |
| 11 | 39132 | |
| 10 | 37710 | |
| 9 | 65128 | |
| 8 | 33460 | |
| 7 | 73779 | |
| 6 | 30856 | |
| 5 | 53614 | |
| 4 | 48220 | |
| 3 | 21535 | 3.7% |
| 2 | 66552 |
loudness
Real number (ℝ)
| Distinct | 29196 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -10.206067 |
| Minimum | -60 |
|---|---|
| Maximum | 5.376 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 586453 |
| Negative (%) | > 99.9% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | -60 |
|---|---|
| 5-th percentile | -19.843 |
| Q1 | -12.891 |
| median | -9.243 |
| Q3 | -6.482 |
| 95-th percentile | -3.91 |
| Maximum | 5.376 |
| Range | 65.376 |
| Interquartile range (IQR) | 6.409 |
Descriptive statistics
| Standard deviation | 5.0893279 |
|---|---|
| Coefficient of variation (CV) | -0.49865712 |
| Kurtosis | 2.7175721 |
| Mean | -10.206067 |
| Median Absolute Deviation (MAD) | 3.095 |
| Skewness | -1.2359834 |
| Sum | -5987613.6 |
| Variance | 25.901258 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -8.026 | 116 | < 0.1% |
| -5.797 | 95 | < 0.1% |
| -4.47 | 95 | < 0.1% |
| -5.584 | 81 | < 0.1% |
| -7.348 | 80 | < 0.1% |
| -6.484 | 79 | < 0.1% |
| -7.031 | 78 | < 0.1% |
| -7.016 | 78 | < 0.1% |
| -8.871 | 78 | < 0.1% |
| -6.651 | 78 | < 0.1% |
| Other values (29186) | 585814 |
| Value | Count | Frequency (%) |
| -60 | 27 | |
| -57.093 | 1 | < 0.1% |
| -55 | 1 | < 0.1% |
| -54.837 | 1 | < 0.1% |
| -54.376 | 1 | < 0.1% |
| -53.986 | 1 | < 0.1% |
| -53.598 | 1 | < 0.1% |
| -51.8 | 1 | < 0.1% |
| -50.174 | 1 | < 0.1% |
| -49.328 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5.376 | 1 | |
| 5.109 | 1 | |
| 4.584 | 1 | |
| 4.362 | 1 | |
| 4.11 | 1 | |
| 3.855 | 1 | |
| 3.744 | 1 | |
| 3.575 | 1 | |
| 3.498 | 1 | |
| 3.273 | 1 |
mode
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 586672 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 386498 | |
| 0 | 200174 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 386498 | |
| 0 | 200174 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 386498 | |
| 0 | 200174 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 586672 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 386498 | |
| 0 | 200174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 586672 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 386498 | |
| 0 | 200174 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 586672 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 386498 | |
| 0 | 200174 |
speechiness
Real number (ℝ)
| Distinct | 1655 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10486354 |
| Minimum | 0 |
|---|---|
| Maximum | 0.971 |
| Zeros | 329 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0276 |
| Q1 | 0.034 |
| median | 0.0443 |
| Q3 | 0.0763 |
| 95-th percentile | 0.422 |
| Maximum | 0.971 |
| Range | 0.971 |
| Interquartile range (IQR) | 0.0423 |
Descriptive statistics
| Standard deviation | 0.17989279 |
|---|---|
| Coefficient of variation (CV) | 1.7154941 |
| Kurtosis | 13.417449 |
| Mean | 0.10486354 |
| Median Absolute Deviation (MAD) | 0.0133 |
| Skewness | 3.6939506 |
| Sum | 61520.504 |
| Variance | 0.032361417 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.0312 | 2002 | 0.3% |
| 0.033 | 1997 | 0.3% |
| 0.0332 | 1990 | 0.3% |
| 0.0308 | 1990 | 0.3% |
| 0.0324 | 1979 | 0.3% |
| 0.0309 | 1974 | 0.3% |
| 0.0326 | 1973 | 0.3% |
| 0.0319 | 1972 | 0.3% |
| 0.0311 | 1970 | 0.3% |
| 0.0313 | 1962 | 0.3% |
| Other values (1645) | 566863 |
| Value | Count | Frequency (%) |
| 0 | 329 | |
| 0.0216 | 2 | < 0.1% |
| 0.0218 | 2 | < 0.1% |
| 0.022 | 2 | < 0.1% |
| 0.0221 | 6 | < 0.1% |
| 0.0222 | 7 | < 0.1% |
| 0.0223 | 17 | < 0.1% |
| 0.0224 | 10 | < 0.1% |
| 0.0225 | 18 | < 0.1% |
| 0.0226 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.971 | 3 | < 0.1% |
| 0.97 | 7 | < 0.1% |
| 0.969 | 25 | < 0.1% |
| 0.968 | 37 | < 0.1% |
| 0.967 | 41 | < 0.1% |
| 0.966 | 92 | < 0.1% |
| 0.965 | 117 | |
| 0.964 | 161 | |
| 0.963 | 230 | |
| 0.962 | 249 |
acousticness
Real number (ℝ)
| Distinct | 5217 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.44986272 |
| Minimum | 0 |
|---|---|
| Maximum | 0.996 |
| Zeros | 66 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.00177 |
| Q1 | 0.0969 |
| median | 0.422 |
| Q3 | 0.785 |
| 95-th percentile | 0.983 |
| Maximum | 0.996 |
| Range | 0.996 |
| Interquartile range (IQR) | 0.6881 |
Descriptive statistics
| Standard deviation | 0.3488367 |
|---|---|
| Coefficient of variation (CV) | 0.77542922 |
| Kurtosis | -1.4661743 |
| Mean | 0.44986272 |
| Median Absolute Deviation (MAD) | 0.3403 |
| Skewness | 0.15116105 |
| Sum | 263921.86 |
| Variance | 0.12168704 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.995 | 4610 | 0.8% |
| 0.994 | 3574 | 0.6% |
| 0.993 | 2913 | 0.5% |
| 0.992 | 2490 | 0.4% |
| 0.991 | 2320 | 0.4% |
| 0.99 | 2091 | 0.4% |
| 0.989 | 1916 | 0.3% |
| 0.988 | 1685 | 0.3% |
| 0.987 | 1581 | 0.3% |
| 0.996 | 1575 | 0.3% |
| Other values (5207) | 561917 |
| Value | Count | Frequency (%) |
| 0 | 66 | |
| 1 × 10-6 | 1 | < 0.1% |
| 1.01 × 10-6 | 3 | < 0.1% |
| 1.03 × 10-6 | 2 | < 0.1% |
| 1.04 × 10-6 | 2 | < 0.1% |
| 1.05 × 10-6 | 2 | < 0.1% |
| 1.06 × 10-6 | 2 | < 0.1% |
| 1.07 × 10-6 | 3 | < 0.1% |
| 1.08 × 10-6 | 2 | < 0.1% |
| 1.09 × 10-6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.996 | 1575 | 0.3% |
| 0.995 | 4610 | |
| 0.994 | 3574 | |
| 0.993 | 2913 | |
| 0.992 | 2490 | |
| 0.991 | 2320 | |
| 0.99 | 2091 | |
| 0.989 | 1916 | |
| 0.988 | 1685 | 0.3% |
| 0.987 | 1581 | 0.3% |
instrumentalness
Real number (ℝ)
| Distinct | 5402 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.11345078 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 205083 |
| Zeros (%) | 35.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2.45 × 10-5 |
| Q3 | 0.00955 |
| 95-th percentile | 0.874 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.00955 |
Descriptive statistics
| Standard deviation | 0.26686787 |
|---|---|
| Coefficient of variation (CV) | 2.3522788 |
| Kurtosis | 3.5472102 |
| Mean | 0.11345078 |
| Median Absolute Deviation (MAD) | 2.45 × 10-5 |
| Skewness | 2.2703983 |
| Sum | 66558.397 |
| Variance | 0.07121846 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 205083 | |
| 0.911 | 410 | 0.1% |
| 0.904 | 402 | 0.1% |
| 0.916 | 402 | 0.1% |
| 0.905 | 399 | 0.1% |
| 0.917 | 399 | 0.1% |
| 0.901 | 396 | 0.1% |
| 0.912 | 396 | 0.1% |
| 0.888 | 392 | 0.1% |
| 0.897 | 387 | 0.1% |
| Other values (5392) | 378006 |
| Value | Count | Frequency (%) |
| 0 | 205083 | |
| 1 × 10-6 | 140 | < 0.1% |
| 1.01 × 10-6 | 261 | < 0.1% |
| 1.02 × 10-6 | 257 | < 0.1% |
| 1.03 × 10-6 | 272 | < 0.1% |
| 1.04 × 10-6 | 271 | < 0.1% |
| 1.05 × 10-6 | 241 | < 0.1% |
| 1.06 × 10-6 | 231 | < 0.1% |
| 1.07 × 10-6 | 262 | < 0.1% |
| 1.08 × 10-6 | 239 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 22 | |
| 0.999 | 17 | |
| 0.998 | 9 | < 0.1% |
| 0.997 | 15 | |
| 0.996 | 11 | |
| 0.995 | 14 | |
| 0.994 | 18 | |
| 0.993 | 22 | |
| 0.992 | 21 | |
| 0.991 | 23 |
liveness
Real number (ℝ)
| Distinct | 1782 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.21393502 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 43 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0589 |
| Q1 | 0.0983 |
| median | 0.139 |
| Q3 | 0.278 |
| 95-th percentile | 0.653 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1797 |
Descriptive statistics
| Standard deviation | 0.1843256 |
|---|---|
| Coefficient of variation (CV) | 0.8615962 |
| Kurtosis | 4.2887807 |
| Mean | 0.21393502 |
| Median Absolute Deviation (MAD) | 0.058 |
| Skewness | 2.0448023 |
| Sum | 125509.68 |
| Variance | 0.033975926 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.111 | 5579 | 1.0% |
| 0.11 | 5310 | 0.9% |
| 0.109 | 5173 | 0.9% |
| 0.108 | 5162 | 0.9% |
| 0.107 | 4946 | 0.8% |
| 0.112 | 4834 | 0.8% |
| 0.106 | 4788 | 0.8% |
| 0.105 | 4674 | 0.8% |
| 0.104 | 4592 | 0.8% |
| 0.103 | 4442 | 0.8% |
| Other values (1772) | 537172 |
| Value | Count | Frequency (%) |
| 0 | 43 | |
| 0.00572 | 1 | < 0.1% |
| 0.00838 | 1 | < 0.1% |
| 0.00967 | 1 | < 0.1% |
| 0.00986 | 1 | < 0.1% |
| 0.00989 | 1 | < 0.1% |
| 0.0101 | 1 | < 0.1% |
| 0.0108 | 2 | < 0.1% |
| 0.0111 | 2 | < 0.1% |
| 0.0112 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 4 | < 0.1% |
| 0.999 | 4 | < 0.1% |
| 0.998 | 4 | < 0.1% |
| 0.997 | 13 | < 0.1% |
| 0.996 | 12 | < 0.1% |
| 0.995 | 17 | < 0.1% |
| 0.994 | 21 | |
| 0.993 | 19 | |
| 0.992 | 33 | |
| 0.991 | 46 |
valence
Real number (ℝ)
| Distinct | 1805 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55229247 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 369 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.121 |
| Q1 | 0.346 |
| median | 0.564 |
| Q3 | 0.769 |
| 95-th percentile | 0.946 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.423 |
Descriptive statistics
| Standard deviation | 0.25767094 |
|---|---|
| Coefficient of variation (CV) | 0.46654798 |
| Kurtosis | -1.0372164 |
| Mean | 0.55229247 |
| Median Absolute Deviation (MAD) | 0.211 |
| Skewness | -0.15230595 |
| Sum | 324014.53 |
| Variance | 0.066394312 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.961 | 2679 | 0.5% |
| 0.962 | 2312 | 0.4% |
| 0.963 | 2023 | 0.3% |
| 0.964 | 1846 | 0.3% |
| 0.96 | 1651 | 0.3% |
| 0.965 | 1599 | 0.3% |
| 0.966 | 1489 | 0.3% |
| 0.967 | 1349 | 0.2% |
| 0.968 | 1155 | 0.2% |
| 0.969 | 948 | 0.2% |
| Other values (1795) | 569621 |
| Value | Count | Frequency (%) |
| 0 | 369 | |
| 1 × 10-5 | 108 | < 0.1% |
| 6.41 × 10-5 | 1 | < 0.1% |
| 0.000183 | 1 | < 0.1% |
| 0.000562 | 1 | < 0.1% |
| 0.000998 | 1 | < 0.1% |
| 0.00123 | 1 | < 0.1% |
| 0.00128 | 1 | < 0.1% |
| 0.00142 | 1 | < 0.1% |
| 0.00155 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 14 | |
| 0.999 | 3 | < 0.1% |
| 0.997 | 5 | < 0.1% |
| 0.996 | 7 | < 0.1% |
| 0.995 | 6 | < 0.1% |
| 0.994 | 12 | |
| 0.993 | 5 | < 0.1% |
| 0.992 | 15 | |
| 0.991 | 19 | |
| 0.99 | 27 |
tempo
Real number (ℝ)
| Distinct | 122706 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118.46486 |
| Minimum | 0 |
|---|---|
| Maximum | 246.381 |
| Zeros | 328 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 75.92355 |
| Q1 | 95.6 |
| median | 117.384 |
| Q3 | 136.321 |
| 95-th percentile | 174.00845 |
| Maximum | 246.381 |
| Range | 246.381 |
| Interquartile range (IQR) | 40.721 |
Descriptive statistics
| Standard deviation | 29.764108 |
|---|---|
| Coefficient of variation (CV) | 0.25124842 |
| Kurtosis | -0.063967333 |
| Mean | 118.46486 |
| Median Absolute Deviation (MAD) | 20.601 |
| Skewness | 0.40326627 |
| Sum | 69500014 |
| Variance | 885.90211 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 328 | 0.1% |
| 128.003 | 98 | < 0.1% |
| 119.994 | 91 | < 0.1% |
| 139.98 | 89 | < 0.1% |
| 127.994 | 86 | < 0.1% |
| 127.997 | 85 | < 0.1% |
| 128.01 | 82 | < 0.1% |
| 119.993 | 82 | < 0.1% |
| 120 | 81 | < 0.1% |
| 127.999 | 81 | < 0.1% |
| Other values (122696) | 585569 |
| Value | Count | Frequency (%) |
| 0 | 328 | |
| 30.506 | 1 | < 0.1% |
| 30.946 | 1 | < 0.1% |
| 31.21 | 1 | < 0.1% |
| 31.262 | 1 | < 0.1% |
| 31.29 | 1 | < 0.1% |
| 31.69 | 1 | < 0.1% |
| 31.988 | 1 | < 0.1% |
| 32.163 | 1 | < 0.1% |
| 32.205 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 246.381 | 1 | |
| 243.759 | 1 | |
| 243.507 | 1 | |
| 243.372 | 1 | |
| 240.782 | 1 | |
| 239.906 | 1 | |
| 238.895 | 1 | |
| 236.799 | 1 | |
| 236.134 | 1 | |
| 233.013 | 1 |
time_signature
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| 4 | |
|---|---|
| 3 | |
| 5 | 11400 |
| 1 | 6604 |
| 0 | 337 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 586672 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 5 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 503808 | |
| 3 | 64523 | 11.0% |
| 5 | 11400 | 1.9% |
| 1 | 6604 | 1.1% |
| 0 | 337 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 503808 | |
| 3 | 64523 | 11.0% |
| 5 | 11400 | 1.9% |
| 1 | 6604 | 1.1% |
| 0 | 337 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 503808 | |
| 3 | 64523 | 11.0% |
| 5 | 11400 | 1.9% |
| 1 | 6604 | 1.1% |
| 0 | 337 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 586672 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 503808 | |
| 3 | 64523 | 11.0% |
| 5 | 11400 | 1.9% |
| 1 | 6604 | 1.1% |
| 0 | 337 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 586672 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 503808 | |
| 3 | 64523 | 11.0% |
| 5 | 11400 | 1.9% |
| 1 | 6604 | 1.1% |
| 0 | 337 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 586672 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 503808 | |
| 3 | 64523 | 11.0% |
| 5 | 11400 | 1.9% |
| 1 | 6604 | 1.1% |
| 0 | 337 | 0.1% |
| popularity | duration_ms | danceability | energy | key | loudness | speechiness | acousticness | instrumentalness | liveness | valence | tempo | explicit | mode | time_signature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| popularity | 1.000 | 0.154 | 0.180 | 0.309 | 0.015 | 0.351 | -0.023 | -0.368 | -0.236 | -0.068 | 0.004 | 0.072 | 0.246 | 0.033 | 0.066 |
| duration_ms | 0.154 | 1.000 | -0.098 | 0.127 | 0.012 | 0.104 | -0.177 | -0.192 | 0.111 | -0.085 | -0.187 | 0.037 | 0.017 | 0.006 | 0.025 |
| danceability | 0.180 | -0.098 | 1.000 | 0.217 | 0.018 | 0.194 | 0.234 | -0.203 | -0.219 | -0.130 | 0.507 | -0.030 | 0.195 | 0.051 | 0.242 |
| energy | 0.309 | 0.127 | 0.217 | 1.000 | 0.036 | 0.771 | 0.167 | -0.718 | -0.126 | 0.079 | 0.360 | 0.238 | 0.138 | 0.068 | 0.148 |
| key | 0.015 | 0.012 | 0.018 | 0.036 | 1.000 | 0.028 | 0.028 | -0.027 | -0.001 | -0.012 | 0.019 | 0.004 | 0.057 | 0.231 | 0.020 |
| loudness | 0.351 | 0.104 | 0.194 | 0.771 | 0.028 | 1.000 | 0.021 | -0.529 | -0.258 | 0.018 | 0.224 | 0.182 | 0.142 | 0.047 | 0.190 |
| speechiness | -0.023 | -0.177 | 0.234 | 0.167 | 0.028 | 0.021 | 1.000 | -0.038 | -0.112 | 0.116 | 0.176 | 0.042 | 0.330 | 0.050 | 0.146 |
| acousticness | -0.368 | -0.192 | -0.203 | -0.718 | -0.027 | -0.529 | -0.038 | 1.000 | 0.111 | 0.018 | -0.155 | -0.217 | 0.151 | 0.059 | 0.131 |
| instrumentalness | -0.236 | 0.111 | -0.219 | -0.126 | -0.001 | -0.258 | -0.112 | 0.111 | 1.000 | -0.067 | -0.143 | -0.012 | 0.069 | 0.013 | 0.033 |
| liveness | -0.068 | -0.085 | -0.130 | 0.079 | -0.012 | 0.018 | 0.116 | 0.018 | -0.067 | 1.000 | -0.020 | -0.018 | 0.019 | 0.014 | 0.041 |
| valence | 0.004 | -0.187 | 0.507 | 0.360 | 0.019 | 0.224 | 0.176 | -0.155 | -0.143 | -0.020 | 1.000 | 0.129 | 0.052 | 0.032 | 0.100 |
| tempo | 0.072 | 0.037 | -0.030 | 0.238 | 0.004 | 0.182 | 0.042 | -0.217 | -0.012 | -0.018 | 0.129 | 1.000 | 0.053 | 0.018 | 0.499 |
| explicit | 0.246 | 0.017 | 0.195 | 0.138 | 0.057 | 0.142 | 0.330 | 0.151 | 0.069 | 0.019 | 0.052 | 0.053 | 1.000 | 0.052 | 0.056 |
| mode | 0.033 | 0.006 | 0.051 | 0.068 | 0.231 | 0.047 | 0.050 | 0.059 | 0.013 | 0.014 | 0.032 | 0.018 | 0.052 | 1.000 | 0.020 |
| time_signature | 0.066 | 0.025 | 0.242 | 0.148 | 0.020 | 0.190 | 0.146 | 0.131 | 0.033 | 0.041 | 0.100 | 0.499 | 0.056 | 0.020 | 1.000 |
| id | name | popularity | duration_ms | explicit | artists | id_artists | release_date | danceability | energy | key | loudness | mode | speechiness | acousticness | instrumentalness | liveness | valence | tempo | time_signature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 35iwgR4jXetI318WEWsa1Q | Carve | 6 | 126903 | 0 | ['Uli'] | ['45tIt06XoI0Iio4LBEVpls'] | 1922-02-22 | 0.645 | 0.4450 | 0 | -13.338 | 1 | 0.4510 | 0.674 | 0.744000 | 0.1510 | 0.1270 | 104.851 | 3 |
| 1 | 021ht4sdgPcrDgSk7JTbKY | Capítulo 2.16 - Banquero Anarquista | 0 | 98200 | 0 | ['Fernando Pessoa'] | ['14jtPCOoNZwquk5wd9DxrY'] | 1922-06-01 | 0.695 | 0.2630 | 0 | -22.136 | 1 | 0.9570 | 0.797 | 0.000000 | 0.1480 | 0.6550 | 102.009 | 1 |
| 2 | 07A5yehtSnoedViJAZkNnc | Vivo para Quererte - Remasterizado | 0 | 181640 | 0 | ['Ignacio Corsini'] | ['5LiOoJbxVSAMkBS2fUm3X2'] | 1922-03-21 | 0.434 | 0.1770 | 1 | -21.180 | 1 | 0.0512 | 0.994 | 0.021800 | 0.2120 | 0.4570 | 130.418 | 5 |
| 3 | 08FmqUhxtyLTn6pAh6bk45 | El Prisionero - Remasterizado | 0 | 176907 | 0 | ['Ignacio Corsini'] | ['5LiOoJbxVSAMkBS2fUm3X2'] | 1922-03-21 | 0.321 | 0.0946 | 7 | -27.961 | 1 | 0.0504 | 0.995 | 0.918000 | 0.1040 | 0.3970 | 169.980 | 3 |
| 4 | 08y9GfoqCWfOGsKdwojr5e | Lady of the Evening | 0 | 163080 | 0 | ['Dick Haymes'] | ['3BiJGZsyX9sJchTqcSA7Su'] | 1922 | 0.402 | 0.1580 | 3 | -16.900 | 0 | 0.0390 | 0.989 | 0.130000 | 0.3110 | 0.1960 | 103.220 | 4 |
| 5 | 0BRXJHRNGQ3W4v9frnSfhu | Ave Maria | 0 | 178933 | 0 | ['Dick Haymes'] | ['3BiJGZsyX9sJchTqcSA7Su'] | 1922 | 0.227 | 0.2610 | 5 | -12.343 | 1 | 0.0382 | 0.994 | 0.247000 | 0.0977 | 0.0539 | 118.891 | 4 |
| 6 | 0Dd9ImXtAtGwsmsAD69KZT | La Butte Rouge | 0 | 134467 | 0 | ['Francis Marty'] | ['2nuMRGzeJ5jJEKlfS7rZ0W'] | 1922 | 0.510 | 0.3550 | 4 | -12.833 | 1 | 0.1240 | 0.965 | 0.000000 | 0.1550 | 0.7270 | 85.754 | 5 |
| 7 | 0IA0Hju8CAgYfV1hwhidBH | La Java | 0 | 161427 | 0 | ['Mistinguett'] | ['4AxgXfD7ISvJSTObqm4aIE'] | 1922 | 0.563 | 0.1840 | 4 | -13.757 | 1 | 0.0512 | 0.993 | 0.000016 | 0.3250 | 0.6540 | 133.088 | 3 |
| 8 | 0IgI1UCz84pYeVetnl1lGP | Old Fashioned Girl | 0 | 310073 | 0 | ['Greg Fieler'] | ['5nWlsH5RDgFuRAiDeOFVmf'] | 1922 | 0.488 | 0.4750 | 0 | -16.222 | 0 | 0.0399 | 0.620 | 0.006450 | 0.1070 | 0.5440 | 139.952 | 4 |
| 9 | 0JV4iqw2lSKJaHBQZ0e5zK | Martín Fierro - Remasterizado | 0 | 181173 | 0 | ['Ignacio Corsini'] | ['5LiOoJbxVSAMkBS2fUm3X2'] | 1922-03-29 | 0.548 | 0.0391 | 6 | -23.228 | 1 | 0.1530 | 0.996 | 0.933000 | 0.1480 | 0.6120 | 75.595 | 3 |
| id | name | popularity | duration_ms | explicit | artists | id_artists | release_date | danceability | energy | key | loudness | mode | speechiness | acousticness | instrumentalness | liveness | valence | tempo | time_signature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 586662 | 4Zp3rm12p5PiHToYJflmyy | Meet Again | 57 | 273587 | 0 | ['KIMSEJEONG'] | ['1lFLniFTaPjYCtQZvDXpqu'] | 2020-12-20 | 0.476 | 0.4400 | 3 | -8.508 | 1 | 0.0488 | 0.679 | 0.000000 | 0.0926 | 0.2410 | 135.814 | 4 |
| 586663 | 4ow9HehIdFii1cggylW2k0 | 四季予你 - DJ版 | 47 | 156393 | 0 | ['程響', '阿卓'] | ['7nKA1c1Qn6nI0XA8yburf3', '7g8hOWXtGS16J30CMU1SR7'] | 2020-12-29 | 0.677 | 0.9700 | 0 | -3.388 | 0 | 0.0446 | 0.134 | 0.002340 | 0.3020 | 0.9080 | 140.026 | 4 |
| 586664 | 1Kzjk1EyngBcP4T8x3fyqv | 同行 (新加坡電視劇《愛...沒有距離》主題曲) | 43 | 205238 | 0 | ['Boon Hui Lu'] | ['6PWJWwEm8BSBFAIAUWlwe4'] | 2020-03-03 | 0.743 | 0.6790 | 8 | -3.952 | 1 | 0.0323 | 0.269 | 0.000000 | 0.1330 | 0.3950 | 126.070 | 4 |
| 586665 | 0SjsIzJkZfDU7wlcdklEFR | John Brown's Song | 66 | 185250 | 0 | ['Gregory Oberle'] | ['4MxqhahGRT4BPz1PilXGeu'] | 2020-03-20 | 0.562 | 0.0331 | 1 | -25.551 | 1 | 0.1030 | 0.996 | 0.961000 | 0.1110 | 0.3860 | 63.696 | 3 |
| 586666 | 1ZwZsVZUiyFwIHMNpI3ERt | Skyscraper | 4 | 106002 | 0 | ['Emilie Chin'] | ['4USdOnfLczwUglA3TrdHs2'] | 2020-02-08 | 0.626 | 0.5300 | 5 | -13.117 | 0 | 0.0284 | 0.113 | 0.856000 | 0.1040 | 0.2150 | 120.113 | 4 |
| 586667 | 5rgu12WBIHQtvej2MdHSH0 | 云与海 | 50 | 258267 | 0 | ['阿YueYue'] | ['1QLBXKM5GCpyQQSVMNZqrZ'] | 2020-09-26 | 0.560 | 0.5180 | 0 | -7.471 | 0 | 0.0292 | 0.785 | 0.000000 | 0.0648 | 0.2110 | 131.896 | 4 |
| 586668 | 0NuWgxEp51CutD2pJoF4OM | blind | 72 | 153293 | 0 | ['ROLE MODEL'] | ['1dy5WNgIKQU6ezkpZs4y8z'] | 2020-10-21 | 0.765 | 0.6630 | 0 | -5.223 | 1 | 0.0652 | 0.141 | 0.000297 | 0.0924 | 0.6860 | 150.091 | 4 |
| 586669 | 27Y1N4Q4U3EfDU5Ubw8ws2 | What They'll Say About Us | 70 | 187601 | 0 | ['FINNEAS'] | ['37M5pPGs6V1fchFJSgCguX'] | 2020-09-02 | 0.535 | 0.3140 | 7 | -12.823 | 0 | 0.0408 | 0.895 | 0.000150 | 0.0874 | 0.0663 | 145.095 | 4 |
| 586670 | 45XJsGpFTyzbzeWK8VzR8S | A Day At A Time | 58 | 142003 | 0 | ['Gentle Bones', 'Clara Benin'] | ['4jGPdu95icCKVF31CcFKbS', '5ebPSE9YI5aLeZ1Z2gkqjn'] | 2021-03-05 | 0.696 | 0.6150 | 10 | -6.212 | 1 | 0.0345 | 0.206 | 0.000003 | 0.3050 | 0.4380 | 90.029 | 4 |
| 586671 | 5Ocn6dZ3BJFPWh4ylwFXtn | Mar de Emociones | 38 | 214360 | 0 | ['Afrosound'] | ['0i4Qda0k4nf7jnNHmSNpYv'] | 2015-07-01 | 0.686 | 0.7230 | 6 | -7.067 | 1 | 0.0363 | 0.105 | 0.000000 | 0.2640 | 0.9750 | 112.204 | 4 |